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Abstract 

Dispersal affects the distribution, dynamics and genetic structure of natural populations, and can be significantly different 
between sexes. However, literature records dealing with the dispersal of migratory birds are scarce, as migratory behaviour 
can notably complicate the study of dispersal. We used the barn swallow HIrundo rustica as model taxon to investigate 
patterns of genetic variability in males and in females of a migratory species showing sex-biased dispersal. We collected 
blood samples {n = 1 86) over the period 2006 to 201 1 from adults (H. r. rustica subspecies) nesting in the same breeding site 
at either high (Ireland, Germany and Russia) or low (Spain, Italy and Cyprus) latitude across Europe. We amplified the 
Chromo Helicase DNA gene in all birds in order to warrant a sex-balanced sample size (92 males, 94 females). We 
investigated both uniparental (mitochondrial ND2 gene) and biparental (microsatellite DNA: 10 loci) genetic systems. The 
mtDNA provided evidence for demographic expansion yet no significant partition of the genetic variability was disclosed. 
Nevertheless, a comparatively distant Russian population investigated in another study, whose sequences were included in 
the present dataset, significantly diverged from all other ones. Different to previous studies, microsatellites highlighted 
remarkable genetic structure among the studied populations, and pointed to the occurrence of differences between male 
and female barn swallows. We produced evidence for non-random patterns of gene flow among barn swallow populations 
probably mediated by female natal dispersal, and we found significant variability in the philopatry of males of different 
populations. Our data emphasize the importance of taking into account the sex of sampled individuals in order to obtain 
reliable inferences on species characterized by different patterns of dispersal between males and females. 
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Introduction 

Distribution, dynamics and genetic structure of natural popu- 
lations can be severely affected by dispersal [1-3], i.e. the 
movement of an organism from its birthplace to its first breeding 
site (natal dispersal) or from one breeding site to another (breeding 
dispersal) [4—5]. Dispersal may be significandy different between 
sexes, as has been well documented in birds and mammals [3,4,6]. 
The costs/benefits of such asymmetry usually depend on the life 
history and the mating system of a given species [7] . In particular, 
dispersal tends to be female-biased in birds and male-biased in 
mammals [4,8,9] . Given that dispersal can significantly affect gene 
flow among populations [10], the dispersing sex may appear as 
genetically less structured. Hence, accounting for sex is funda- 
mental not only to find out potential differences in dispersal [10- 
1 1] but also to rehably infer the genetic structure of populations, as 
the latter could be driven mainly by the phUopatric sex [12]. 



The estimate of the dispersal rate in natural populations is often 
incomplete because it usually requires direct methods with 
intensive, large-scale and long-term demographic studies 
[10,11,13]. However, recent advances in the genetic techniques 
allowed researchers to integrate field investigation with molecular 
DNA analysis. The combined use of markers with different way of 
inheritance (mitochondrial versus nuclear DNA) represented the 
most suitable approach to infer discrepancy between the dispersal 
pattern of males and females. Indeed, differences in the genetic 
picture drawn by mitochondrial and nuclear markers are expected 
when sex-biased dispersal occurs [11]. Studies focusing on the 
apphcation of population genetic tools to infer sex-biased dispersal 
are well known for vertebrates, more frequentiy in birds and 
mammals (e.g.: eiders [14], rodents [15]) than in amphibians and 
reptiles (e.g.: frogs [16], turtles [17]). Nevertheless, as exhaustively 
discussed by MoUer et al. [18], dispersal of migratory birds is poorly 
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studied because routes can greatly complicate the interpretation of 
the genetic scenario [19]. 

The bam swallow Hirundo mstica is a polyt)'pic passerine bird 
widely distributed throughout most of the northern hemisphere 
[20]. This species is extensively studied with reference to its 
morphology and behaviour. For instance, recent studies disclosed 
significant patterns of morphological difierentiation among Euro- 
pean populations in a few characters (ventral coloration, tail 
streamers) known to be under sexual selection [21-24]. As far as 
the migratory behaviour is concerned, European barn swallows 
can be divided in two main groups: one breeds in south-western 
Europe and winters in central and western Africa, the other breeds 
in northern Europe and winters in southern Africa [25]. 
Differences in morphological and behavioural traits notwithstand- 
ing, the occurrence of some degree of genetic differentiation 
among European H. mstica populations has never been proved by 
using either mitochondrial or microsatellite DNA markers 
[23,24,26]. However, although the barn swallow is a female- 
biased dispersal species (msdes are more phUopatric than females, 
e.g. [27]), the sex of the investigated individuals has never been 
taken into account in any genetic study focusing on this taxon. 

In this work we aim at: (i) analysing the genetic variability of 
European barn swallow populations over a wide sampling area by 
means of markers from both uniparental (mitochondrial DNA: 
mtDNA) and biparental (microsatellite DNA) genetic systems; (ii) 
testing the consequences of sex-biased dispersal on the population 
genetic structure by comparing patterns of variation in a well 
balanced sample of males and females. Overall, the barn swallow 
represents an excellent model among migratory species to 
investigate patterns of genetic variation in males and females. 
While lack of genetic structure is expected for the whole sample 
size, we predict the occurrence of different genetic pattern between 
male and female bam swallows [12,16]. 

Materials and Methods 

Ethics statement 

The barn swallow is not an endangered species in all trapping 
areas of this study. Samples were obtained in the same place (six 
localities) in different years. Adults were trapped with mist-nets. 
Samples (one blood droplet) were collected by means of wing 
venipuncture (brachial/radial/ulnar vein). Birds were not sedated 
and did not suffer any injury: all of them were released 10 min 
after blood collection. 

We report here below the coordinates of the sampling localities 
(see also Table S 1 ) together with the information about the permits 
issued for each specific area: (1) TuUynisk, Offaly (Ireland: 
53°07'N, 07°54'W). The licensing authority in Ireland is die 
National Parks and Wildlife Service (NPWS), which provided an 
annual (renewable) license for the present study (NPWS references 
57/2009 and C41/2010). Separately, the capture of birds is also 
controlled through the British Trust for Ornithology' Ringing 
Scheme. AU licenses and permits were obtained by, and in the 
name of, A.S. Copland (BTO permit number A51 15). All samples 
were taken from birds at a privately-owned site. Access to this site 
was arranged through the regional staff' of the NPWS, who also 
have contact details for the owner/manager; (2) Itzehoe 
(Germany: 53°56'N, 09°3rE). Samples were collected by S. 
Martens, who is ringer at the Institute for Avian Research 
"Vogelwarte Helgoland". Samples were collected at a private farm 
and future permissions should be requested to the owner of the 
same; (3) Lake Ladoga (Russian Federation: 60°40'N, 32°56'E). 
Samples were collected by O. Babushkina at the Ladoga 
Ornithological Station of the Biological Research Institute of the 



Saint Petersburg State University. The Ladoga Omithological 
Station is comprised within the Nizhne-Svirsky State Reserve. AU 
work at the station adhered to the current legislation of the 
Russian Federation and to the institutional guidelines of the State 
University of Saint Petersburg. No specific permits were needed to 
O. Babushkina. Future permissions should be requested to the 
Biological Research Institute of the Saint Petersburg State 
University; (4) Gorliz (Spain: 43°24'N, 02°57'W). Samples were 
collected by I. Zuberogoitia. He obtained a license to trap barn 
swallows and collect blood samples by the Department of Medio 
Ambiente (Diputacion Foral dc Bizkaia). The same administration 
manages the experimental farm for the selection of cow races 
where the sampling was carried out, and all permissions were 
obtained in the same site. Future permissions should be requested 
to the same Department; (5) Orti-Bottagone Nature Reserve (Italy: 
42°57'N, 10°35'E). Samples were collected by R. CecchereUi 
(veterinary, CRUMA, Leghorn). Land is a private property (oasis) 
of World Wildlife Fund (WWF) and future permissions should be 
requested to WWF Italy; (6) Polls (Cyprus: 35°02'N, 32°25'E). 
Samples were collected in a govemment land by A. Crabtree (Vice 
Chairman and Ringing Officer of BirdLife Cyprus), who obtained 
a specific permit from the competent authority of Cyprus (Game & 
Fauna Service, Ministry of Interior, Nicosia). Future permissions 
should be requested to the same Ministry. 

Biological sampling 

Blood samples («= 186) were collected in subsequent years 
(2006-2011) from nesting adults [H. r. mstica subspecies) at the 
same breeding site in six areas across Europe (Figure 1, Table SI): 
Gorliz, Spain (SPA, n = 32); TuUynisk, Ireland (IRE, n = 26); Orti- 
Bottagone Nature Reserve, Italy (ITA, n = 33); Itzehoe, Germany 
(GER, n = 33); Polls, Cyprus (GYP, n = 32); Lake Ladoga, Russia 
(RUS, n = 30). One blood droplet was collected on filter paper 
(Whatman, UK) from each bird by wing venipuncture. Each 
swallow was ringed and sexed according to a PCR-based assay (see 
below). 

DNA extraction 

Genomic DNA was extracted using the Puregene Core Kit-A 
(Qiagen, Germany) following the manufacturer's instmctions. 
DNA content and purity were determined with an Eppendorf 
BioPhotometer (AG Eppendorf, Germany). 

Sexing 

Chromo Helicase DNA (CHD) gene of ZZ (males) and ZW 
(females) sexual chromosomes was amplified with primers L1237 
(5'-GAGAAACTGTGCAAAACA-3') and HI 272 (5'-TCCA- 
GAATATCTTCTGCTCC-3') previously tested in other bird 
species [28]. PCRs were prepared in 25 |iL as in [29] and 
performed in a MyCycler thermal cycler (Biorad, USA) with the 
following profUe: 3 min at 94°C, 30 cycles of 30 s at 94°C, 1 min 
at 48°C and 45 s at 72°C, finally 7 min at 72°C. PGR products 
were run in a 3.5% agarose gel for 60 min together with positive 
controls for male and female individuals. We set-up the PCR- 
based procedure by testing 20 barn swallows whose sex (10 males, 
10 females) was determined through the inspection of standard 
morphological traits [30] by one of us (P.M. Politi). 

Mitochondrial DNA 

Laboratory procedure. The entire mtDNA gene codifying 
for the second sub-unit of the NADH dehydrogenase (ND2, 
1041 bp) was amplified using primers L5216 and H6313 [31]. 
PCRs (50 nL) were mn in a MyCycler thermal cycler (Biorad) as 
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Figure 1. Barn swallow breeding populations sampled in this study. (1): SPA, Spain; (2): IRE, Ireland; (3): ITA, Italy; (4): GER, Germany; (5): GYP, 
Cyprus; (6): RUS, Russia. Mitochondrial DNA sequences available in the GenBank were obtained from Russian populations of Krasnodar {7, KRD) and 
Medvedevo (8, MED) (Table SI). 
doi:1 0.1 371 /journal.pone.0098574.g001 



in [29] . PGR products were purified using GenElute PGR Glean- 
up Kit (Sigma Aldrich, Italy) and directly sequenced on both DNA 
strands using the BigDye Terminator v. 3.1 Cycle Sequencing Kit 
on an ABI 3730 DNA automated sequencer (Applied Biosystems, 
USA) at Genechron (Rome, Italy). We amplified the ND2 gene in 
a subset of samples (10 for each population, « = 60), and we 
included in the alignment (clustalx v. 1.81: [32]) 16 sequences 
from the GenBank (Russia: Krasnodar Kray, KRD, n = 7 and 
Arkhangel'.sk Oblast', MED, n = 9: [33]) (Figure 1, Table SI). 

Population genetic inferences. We used dnasp v. 5.1 [34] 
to infer the mtDNA haplotypes. A network was constructed using 
DNA ALIGNMENT V. 1.3.3.1 (2003-2013 Pluxus Technology, UK) 
and the Median Joining method of [35] with kei work v. 4.5.1.0 
(2004-2009 Fluxus Technology). We used areequin v. 3.5.1 [36] 
to: (i) calculate the haplotype diversity (A), the nucleotide diversity 
(7t) and the mean number of pairwise differences (k); (n) investigate 
the partition of the mtDNA diversity (Analysis of the Molecular 
Variance, amova) among and within the populations using the 
Phisx analogous to Wright's i<"-statistics (1000 permutations) [37]; 
(iii) compute the average genetic distance among populations 
(1000 replicates witii the TN93 algoritiim) [38]. 

Historical demography. Inferences of historical demogra- 
phy were obtained using dnasp and different statistics as described 
in [39]. The analysis included (i) males and females plus the 
GenBank sequences (n = 60-1-16 — 76), (ii) only the males (n = 27, no 
GenBank entries), and (iii) only the females [n — 33, no GenBank 
entries). Ramirez-Soriano et al. [40] investigated the statistical 
power of a wide range of statistics computed on DNA 
polymorphism data in detecting a sudden population expansion, 
a sudden contraction or a bottieneck. They found that the most 
powerful tests were those based on haplotype frequencies, 
including the of Fu [41] and the i?2 statistic [42]. In this study, 
the significance of the and i?2 statistics was investigated by 
examining the null distribution of 5000 coalescence simulations 
using DNASP. Only significant negative and positive i?2 values 



were retained as evidence of population expansion [39]. We also 
computed the Tajima's D [43] . Nevertheless, [42] reported that 
statistic has a greater power than the Tajima's D or Fs, to detect 
population expansion when the sample size is small (~10). 
Furthermore, the McDonald-Kreitman test [44] as implemented 
in DNASP was conducted for the entire dataset to investigate the 
deviation from an equal ratio of non-synonymous (KJ to 
synonymous (K^) fix(-d substitutions. Specifically for this test we 
used two US H. r. erythrogaster samples as outgroup (UWBM 78832 
and UWBM 80547 from the University of Washington Burke 
Museum of Natural History, Seattie, USA; GenBank accession 
codes: HF548593-94). 

The Mismatch Distribution (MD) of mtDNA pairwise differ- 
ences was also examined using areequin (males + females, males 
only, females only). The more ragged the shape of the distribution 
the closer was the population to a stationary model of constant size 
over a long period (raggedness index, r) [45] . The MD test uses the 
observed parameters of the expansion to perform coalescent 
simulations and to create new estimates of the same parameters. 
Departure from a model of sudden expansion was tested for each 
population by summing the squared differences (SSD) between 
observed and estimated MD [46,47]. 

Microsatellite DNA 

Laboratory procedure. All samples (n=186) were investi- 
gated at 10 loci of the microsatellite DNA (Short Tandem Repeats, 
STR) reported in [48,49]. PGRs (12.5 ^lL) were performed as in 
[50] (Table 1). Gene sizing was carried out at the Research Centre 
of Clinical and Molecular Genetics (Pisa, Italy) on an ABI Prism 
3730 DNA automated sequen[:er using GEKESCW (Applied 
Biosystems). For the statistical analyses we used either the whole 
sample size («=186) or males (k = 92) and females (re = 94) 
separately. 

Genetic variability and relatedness. The discriminatory 
power of the whole set of STR loci was evaluated with gimlet v. 
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1.3.3 [51] by estimating the probability that two individuals drawn 
at random from the populations showed identical multilocus 
s; S genotypes by chance (Pid and Piu sib: for the latter, we assumed 

^ sibling relationships) [52,53]. Moreover, all loci were investigated 

I ^ using MICRO-CHECKER V. 2.2.3 [54] to check for null alleles, allele 

dropout and scoring errors due to stuttering. Arlequin, fstat v. 
2.9.3 [55] and genepop v. 3.4 [56] were used in order to: (i) 
compute the number of alleles per locus, the number of unique 
alleles and the allelic richness; (ii) calculate expected (//e) and 
observed {Ho) heterozygosity; (iii) infer deviations from both 
Hardy- Weinberg Equilibrium (HWE) and Linkage Disequilibrium 
(LE) (10 000 dememorisations, 100 batches, 5000 iterations per 
batch); (iv) estimate gene flow (jV^m, effective number of migrants 
per generation) via the private allele method of Slatkin [57]; (v) 
5 investigate the partition of the STR diversity within and among 

= o" populations by amova; (vi) infer the degree of genetic differenti- 

_g ^ ation among populations by estimating the average Fg-p distance 

§_ S values. Bonferroni correction [58] was adopted to adjust the 

^ 3 significance level of each test. The average i^sx distance values 

^'Z were plotted on the first two axes of a Principal Component 

Analysis (PCA) using siatistica 5.0/W (Statsoft Inc., USA). 

Population genetic structure. Bayesian clustering analysis 
was performed with structure v. 2.3.4 [59] to investigate the 
spatial structure of the genetic diversity. We focused on identifying 
the K (unknown) clusters of origin of the sampled individuals and 
g.i to simultaneously assign them to each cluster. We assumed 

correlated allele frequencies and we used a prior population 
I 2 information option to take the sampling locality into account [60] . 
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iterations, following a burn-in period of 10'^ iterations, and were 
replicated five times per each A'-value (1 to 12). The number of 
clusters that best fitted to the data was chosen using the formula of 
Evanno et al. [61]. An identification threshold to each cluster was 
selected (Qj = 0.80) as in [62]. 

Sex-biased dispersal. In order to test for possible differences 
in the dispersal rate between males and females, fstat was used to 
calculate five different parameters: i^is, ^sx, relatedness (R), mean 
[mAIc) and variance (M/c) of the assignment index [AIq) within 
each sex [10]. The latter estimates the probability that a given 
genotype originates from the population where it was sampled, 
and the statistical significance is determined by a two-tailed test 
using 10 000 randomizations. Low mAIc and high vAIq, values are 
interpreted for the dispersing sex. While ^sx and R are expected to 
be larger in the philopatric than in the dispersing sex, the opposite 
occurs for Fi^,. Nevertheless, the power of these statistics depends 
_g '§ on dispersal rates, bias intensity, sampling design and the number 

1^ of loci [10]. 

< s, 

g :i Results 

I I Sexing 

J: I ^ The CHD gene was amplified in all barn swallows: 92 birds 

o g g were identified as males (single PGR product, ca. 200 bp) while 94 

Q- S .g 2 as females (two PGR products, ca. 200 and 240 bp). 

I — — ™ ^ 

o ^ 

sl^ 8 

S.| S ^ Population genetics. The alignment of 76 (60+16) ND2 

a; 1^ g- P- sequences produced 31 haplotypes (H1-H31: GenBank accession 

fs.^ £ codes HF548562-HF548592) including 27 polymorphic sites. 

S ij- Estimates of all parameters are summarized in Table S2. SPA 

^ ^ £ ^ and IRL populations showed the lowest number of haplotypes as 

O 5" cc 5 well as the lowest values of haplotype diversity (Ji), average number 

^ S g ;5 of pairwise differences [Hj and nucleotide diversity (n). The highest 
number of haplotypes was found in GYP and MED. The Median 
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Joining network showed that all barn swallow populations were 
genetically admixed with no noticeable divergence among 
haplotypes (Figure 2). In particular, two haplotypes (HI and H4) 
were common to all populations. The 88.4% of the mtDNA 
variability was partitioned within populations while the 11.6% 
among them (PhisT = 0.131, P<0.001: data not shown). Only 
MED significantly diverged from aU other populations (Phisx 
range: 0.28-0.38, aU P<0.001, Table S3). When amova was 
performed either without KRD and MED populations or 
excluding MED only, we found that the partition of the mtDNA 
variability among populations decreased to 1.86% and 1.33%, 
respectively (Phisx = 0.02 and 0.01, respectively, P>0.05: data not 
shown). Finally, when all the estimated parameters were computed 
by including in the analysis males and females separately, all 
results matched those produced using the entire dataset (data not 
shown). 

Historical demography. With the sole exception of the SSD 
test {P= 0.022), the .stati.stics we used {D= -2.047, Fs = -56.028 and 
^^2 = 0.040: all P<0.001) could not exclude the occurrence of a 
demographic expansion for the entire sample size (n = 76) (Figure 2). 
Indeed, the beU-shaped (r= 0.088) curve obtained for the Mismatch 
Distribution (MD) of the pairwise differences suggested a rapid 
population growth. When the demographic parameters were 
computed using males (n = 27) and females (h = 33) (GenBank entries 
excluded) separately, males showed exactly the same trend 
as the whole dataset (D = - 1 .970, i^s = - 1 3. 1 78 and R2 - 0.056: all 
P<0.001; SSD test: r= 0.158, P= 0.023), while for the females a 
demographic expansion was suggested by all statistics (i)= —1.571, 
i?s = -18.449 and i?2 = 0.064: aU P<0.001; SSD test: r=0.123, 
P= 0.063). The McDonald-Kreitman test was not significant for the 
whole sample size (Fisher exact test, P= 0.16). 



Microsatellite DNA: all Individuals 

Genetic variability. All STR loci employed were highly 
polymorphic. In the entire sample size («= 186) the STR panel 
was powerful in discriminating individuals (Pid = 8. 16 x 10 and 
PiDsib = 3.29xl0"^ Table 1), as values lower than 0.001 can be 
considered as satisfactory [53]. Micro-checker did not provide 
evidence for allele dropout or scoring errors due to stuttering, 
although three loci (Hir4, Hir7 and Hir24) showed an excess of 
homozygotes for most of the aUele-size classes, thus pointing to the 
possible presence of nuU alleles (data not shown). The total number 
of alleles at each locus ranged between 6 and 25 (Hir5 and Hir7, 
respectively), with a mean of 13.4 alleles per locus (Table 1). 

The average values of Hq were smaller than for each locus 
(Fisher exact test, P<0.001 aU loci, Table 1) and ranged between 
0.39 and 0.80 (Hir4 and Hir6, respectively: Table 1). There was no 
evidence of EE at any pair of loci after sequential Bonferroni 
correction (P>0.05, all comparisons: data not shown). Both the 
number of alleles and gene diversity of each locus pointed to a very 
high degree of genetic variability. Allelic richness ranged between 
7.9 and 9.9 (SPA and GYP, respectively: Table 2); GYP showed 
the highest number of private alleles {n = 8, Table 2). Average 
levels of Ho and across all loci and populations were relatively 
homogeneous and showed very restricted ranges (//q- 0.60-0.67, 
H^: 0.79-0.82, Table 2). Ho values pointed to a deficiency of 
heterozygotes in all populations, which was confirmed by 
departure from HWE (P<0.001 aU loci, Table 2). One locus in 
IRE, two loci in SPA, three loci in GER and RUS, and four loci in 
ITA and GYP were not in HWE (P<0.001, data not shown). 
However, we found that no locus deviated from HWE in all 
populations, and when the Bonferroni's correction was taken into 
account only three loci (Hir4, Hir7 and Hir24) deviated from 
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Figure 2. Median-Joining network of barn swallow populations computed on mtDNA haplotypes with network. Size of circles is 
proportional to the haplotype frequency. The colour of each population is indicated as well as the number of each haplotype. A length bar to 
compute the number of mutational changes was provided (see also Table SI ). The inset showed Mismatch Distribution (MD) of the mtDNA pairwise 
differences (observed: dotted; expected: line) computed on the whole dataset (n = 76). The expected curve was obtained from simulated values 
computed from the data under the model of demographic expansion (Hq). The Harpending's raggedness index (r) was given with the P value of the 
SSD test as well as Tajima's D, Fu's Fs and R2 statistics. 
doi:1 0.1 371/journal.pone.0098574.g002 
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HWE in some populations yet not in all (P<0.001, data not 
shown). 

We found that 98.6% of the total STR variability was 
partitioned within populations and 1.38% among them 
(^'sT = 0.014, P<0.001). In the PCA plot reported in Figure 3A, 
the first two components explained the 82.8% of the total 
variability, SPA being the most diverging population (e.g., versus 
CYP and RUS, ii'sT = 0.026 and 0.027, respectively: P<0.001). 



Non significant Fsi distance values were found among ITA, GER, 
CYP and RUS (gene flow range: 3.87-6.07, Table 3). 

Population genetic structure. In Figure 4A the whole 
sample size was taken into account. Bayesian clustering analysis 
indicated that barn swallows could be divided into two genetic 
groups {K= 2, see also Table 84). Genetic differentiation was 
strong between SPA and all other populations, and Spanish 
individuals were mosdy assigned to the cluster II (Qj = 0.16; 
Qji = 0.84). While ITA showed the highest number of barn 
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Figure 3. The Principal Component Analysis performed using the average pairwise Fs^ distance values among STR genotypes. The 

percentage of total variance explained by each of the first two components is given. (A) All individuals. (B) Only males. (C) Only females. 
doi:1 0.1 371/journal.pone.0098574.g003 
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swallows with admixed genotype (n = 1 2: Qj = 0.59 and Qp = 0.41 , 
see Table S4), a slight differentiation was found between (IRE + 
GER) and (GYP + RUS) population pairs, yet their individuals 
were all assigned to the cluster 1 (Q[ range: 0.82-0.94). When we 
excluded from the Bayesian clustering analysis the STR loci 
showing null alleles and deviating from HWE (Hir4, Hir7, and 
Hir24: see above), barn swallows were assigned to two genetic 
groups and results matched those obtained when the entire set of 
loci was taken into account (data not shown). 

Microsatellite DNA: males versus females 

Genetic variability. Average levels of Hq and across all 
loci and populations were relatively homogeneous and showed 

deficiency of heterozygotes in both male and female barn swallows 
(Fisher exact test, P<0.001 all loci. Table 2). IRE and CYP 




Figure 4. Bayesian admixture analysis as inferred using struc- 
ture. The AK calculated according to Evanno et al. [61] was optimal for 
K=2 in all computations. Each population was represented by a pie 
chart whose segments were proportional to the number of specimens 
assigned to cluster I (black), to cluster II (white) or which showed 
admixed genotypes (grey). Threshold value for assignment to each 
cluster was Qi = 0.80 fTable S4). (A) All individuals. (B) Only males. (C) 
Only females. 

doi:1 0.1 371/journal.pone.0098574.g004 



showed the highest number of private alleles in males and females, 
respectively (Table 2). Females showed global -Fgx values higher 
than males in all populations (on average, females: Fgx = 0.022, 
P<OM 1 ; males: F^^ = 0.0 11, P= 0.003; Table 2). The majority of 
the total STR variability was partitioned within populations in 
both males and females (98.2% and 97.8%, respectively: data not 
shown). Differences between sexes were found in the PGA plot of 
average i^sT distance values. Males were highly differentiated 
along the 1" component {77.0% of the STR di\'ersity: Figure 3B), 
while in the females the 2'"' component explained a significant 
portion of the total variability (26.I'X>: Figure 3C). In mal(;s, SPA 
was the only divergent population with Fg,-^ values ranging from 
0.022 to 0.050 {versus ITA and GYP, respectively, P<0.05: 
Table 3). Significant F^j- distance values were disclosed among 
other pairs of populations in females but not in males (Table 3). In 
females, the largest genetic distance values were found between 
IRE and ITA (0.034), IRE and CYP (0.036) and IRE and RUS 
(0.048) (all P<0.05, Table 3), whereas no genetic differentiation 
was disclosed among ITA, GYP and RUS as well as among IRE, 
SPA and GER. Gene flow was, on average, higher in females than 
in males (average jY^.m: females = 2.92, males = 2.45, Table 3). 

Population genetic structure. Bayesian clustering analysis 
performed in males and females separately suggested the 
occurrence of two groups {K= 2, Figure 4B, 4G; see also Table 
S4). For each population, males and females showed a largely 
different genetic make-up that could be inferred only partially 
when the entire sample size was included in the analysis (see 
above). When we used only the genotypic data inferred from the 
males, SPA di\ erged from all other populations and showed the 
highest membership value to the cluster II (Qj = 0.21; Qji = 0.79, 
Figure 4B). No similar evidence was observed in the other 
populations: in these latter, most of the individuals were assigned 
to the cluster I (Q; range: 0.75-0.92) and only a very few were 
assigned to the cluster II or showed admixed genotype (Figure 4B, 
Table S4). When we used only the genot)'pic data inferred from 
the females, two different groups were disclosed. One included 
SPA, IRE and GER, which showed the highest assignment values 
to the cluster I (Qj = 0.67, 0.84 and 0.70, respectively: Figure 4G). 
The other comprised ITA, GYP and RUS, which showed the 
highest assignment values to the cluster II (Qji = 0.83, 0.89 and 
0.78, respectively: Figure 4G, Table S4). When SPA was excluded 
from the Bayesian clustering analysis, the pattern showed by the 
females was very similar to that inferred using all populations 
(Figure SI versus Figure 4C): in ITA, CYP and RUS the majority 
of the individuals were included in the black cluster, while IRE 
and GER included many birds assigned to the white cluster. On 
the contrary, when the SPA birds were excluded, a not negligible 
level of genetic diversity came to the fore among the males (Figure 
SI versus Figure 4B), and pointed to the occurrence of three genetic 
groups (IRE + GER, ITA + CYP, and RUS), although the F^t 
distance values were not significantly different among each other 
(not shown). 

Dispersal. Males and females were tested for sex-biased 
dispersal using either all populations separately or keeping SPA on 
its own and grouping all of the other ones. In the first case, females 
showed higher inter-population Fis and vAIc values than males 
(females: i^s = 0.225, »^/c= 10.982; males: i?is = 0.217, 
vAIc = 8.829, Table 4). However, these differences were not 
statistically significant (all parameters). In the second case, Fgx 
(females: Fst = 0.004; males: Fst = 0.028, P= 0.015) audi? values 
(females: R = 0.007; males: = 0.045, P= 0.0 1 7) were significantly 
different and suggested female-biased dispersal. This finding was 
supported further by higher values of Pis in the females than in the 
males as well as by the negative value of mAlQ in the females 
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Table 4. Sex-biased dispersal tests. 







All populations 






SPA versus all others 




Males 


Females 


P -value 


Males 


Females 


P -value 




0.217 


0.225 


0.389 


0.217 


0.234 


0.243 


Fsv 


0.011 


0.015 


0.763 


0.028 


0.004 


0.015* 


R 


0.017 


0.025 


0.752 


0.045 


0.007 


0.017* 


mAlc 


-0.052 


0.051 


0.594 


0.041 


-0.040 


0.444 


vAlc 


8.829 


10.982 


0.147 


15.580 


11.110 


0.898 



^is« f^sTf relatedness (/?}, mean assignment index [mAlc) and variance of mean assignment index {vAlc) were estimated on males and females considering either single 
populations or by separating Spanish barn swallows from all other ones. Significance (P) {two-tailed test) was assessed using the randomization method of [10]; *, P< 
0.05. 

doi:1 0.1 371 /journal.pone.0098574.t004 



(i^LS = 0.234, mAIc = —0.040); however, these difTerences were not 
statistically significant (all parameters. Table 4). 

Discussion 

There are a few studies focusing on migratory bird species and 
relying on the use of molecular DNA markers (e.g., sandhill crane 
[63]; black-throated blue warbler [64]; reed warbler [65]). This 
likely occurs because species with elevated mobility are expected to 
show a higher level of gene flow and a weaker genetic structure 
than sedentary ones [66-68]). However, none of these studies 
accounted for sex in the genetic analysis. Only Ortego et al. [12] 
investigated the correspondence between population genetic 
structure and natal dispersal by analyzing males and females 
separately but in a non-migratory passerine (Cyanistes caeruhm, blue 
tit). On the contrary, studies focusing on other vertebrate species 
with sex-biased dispersal and taking into account the sex of 
individuals in the genetic analysis are known [16,17,69]. In the 
barn swallow, the use of either mitochondrial or microsatellite 
DNA pointed to the admixed genetic structure of the European 
populations investigated so far [23,24,26], and significant genetic 
(mtDNA) distances were found only among barn swallows of 
different continents [33]. Nevertheless, in this species, the genetic 
pattern of males and females has never been compared. 

Mitochondrial DNA 

The mtDNA analysis did not suggest any significant structuring 
of the genetic variability (Figure 2), as it might be expected in a 
species with female-biased dispersal and male philopatry [69] . The 
star-like shape of the network, which included two ancestral 
haplotypes (HI and H4), as well as the bell-shaped curve of the 
MD were consistent with a recent demographic expansion of 
the studied barn swallow populations (Figure 2, Table S2) 
[26,33]. Such a scenario was statistically well supported (Fig. 2: 
Z)= -2.047, i?s = -56.028 and = 0.040, all P<0.001), with the 
SSD statistic {P= 0.022) being an exception. However, although 
all tests we employed may be sensitive to unknown structure within 
populations, [70] stressed that the SSD statistic was actually the 
less powerful. Furthermore, when only the females were analyzed, 
a model of expansion was supported also by the SSD statistic 
(P= 0.063). Overall, we felt confident in considering that such a 
deviation from neutrality was very likely due to demographic 
changes rather than selective processes, as the McDonald-Kreit- 
man test was not significant in the entire dataset [71]). This result 
is in agreement with the relatively recent bam swallow range and 
demographic expansion due to the proliferation of human 



settlements providing widespread availability of suitable nest sites 
[33]. 

The 11.6% of total mtDNA variability was partitioned among 
populations, a value much higher than that (<2%) found by Dor 
et al. [24], who analyzed populations of H. r. rustica and H. r. 
transitiva (Middle East). However, we feel confident that this result 
was due to the divergence of MED, the easternmost and 
northernmost population in our sampling scheme. Indeed, when 
MED was excluded from the analysis the mtDNA genetic 
structure prompdy disappeared (Figure 1, Table S3). 

Microsatellite DNA: general overview 

The loci of the microsatellite DNA investigated in this study 
showed a pattern of variability similar to that reported by [47,48], 
with very high degree of polymorphism and low level of 
relatedness among the genotyped barn swallows (Table 1). 
MicrosateUites did not show any evidence of Linkage Disequilib- 
rium. However, a significant departure from HWE was found in 
all populations due to a deficiency of heterozygotes (Table 2). This 
result was possibly caused by either the occurrence of nuU alleles at 
a few STR loci or the Wahlund effect [72], which, in turn, pointed 
to sub-structuring due to sourcing from different populations with 
different allele frequencies [73]. As discussed by [63], who studied 
migratory sandhill cranes {Cms canadensis), a deficiency of 
heterozygotes may be common in migratory species when a given 
population consists of local and immigrant individuals with 
different origin. Overall, departures from HWE would seem also 
a deficit intrinsic to the barn swallow, which, however, do not 
necessarily affect the result of the genetic analysis. Indeed, 
Bayesian procedure does not require perfect equilibria to cluster 
individuals, yet it attempts to minimize such departures within 
groups [73]. Rodriguez-Ramilo et al. [74] evaluated the accuracy 
of some Bayesian clustering methods when both Hardy-Weinberg 
and Linkage Equilibria were not fuUy respected. They found that 
STRUC TURE could reliably determine the correct number of clusters 
also for i*sT values as low as 0.01; hence, we did not exclude any 
STR locus from our analysis. Nevertheless, we also showed that 
the output of STRUCTURE did not change when Hir4, Hir7 and 
Hir24 loci were ruled out (see Results). 

Microsatellite DNA: males -i- females, males only, females 
only 

Santure et al. [23] and Dor et al. [24] investigated H. rustica 
including European populations with different morphology and 
migratory behaviour. Nevertheless, both mitochondrial and 
microsatellite DNA markers did not disclose any significant 
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genetic structure. By contrasting an equal number of males and 
females from six localities across Europe, we show significandy 
more population structure in males than females. This difference 
in structure can be explained by dependence of dispersal on sex 
(Figure 3, 4; Tables 3, S4). Different from [23], in our study the 
pairwise F^t computations as well as the PCA and the Bayesian 
clustering analysis revealed a genetic picture never before 
reported. While we cannot exclude that the discrepancy between 
the two studies could be due to the different distribution of the 
sampling sites, it should be noted that Santure et al. [23] used a set 
of STR markers (6 loci) smaller than that ( 1 0 loci) employed in the 
present study, and did not provide sex ratio of the investigated 
sample. Hence, it seems likely that our analysis has more power to 
detect any genetic structuring of populations than that performed 
by [23]. Overall, the PCA carried out by using the average 
pairwise -FsT distance values computed among all populations as 
well as the Bayesian clustering analysis (males + females), pointed 
to the strong divergence between SPA and all other populations 
(Figure 3A, 4A, Table 3). This result was even more evident when 
we used only the male genotypes (Figure 3B, 4B, Table 3). For 
instance, in the Bayesian analysis, most of the SPA individuals 
grouped together in the cluster II, whereas the males from all of 
the other populations were mainly assigned to the cluster I 
(Figure 4B). Balbontin et al. [27] obtained similar results in a field 
study. They investigated long-term trends in natal dispersal of 
northern (Denmark) and southern (Spain) Europe barn swallow 
populations. They found female-biased natal dispersal in both 
populations and male phikipatr)- six times higher in the Spanish 
than in the Danish population. In our study, the genetic 
differentiation showed by the SPA population could be due to a 
particularly high rate of natal phUopatry of males. We would like 
to stress that in the bam swallow the choice of the first breeding 
site is crucial, as it can determine where an individual wiU 
reproduce for the rest of its life. Balbontin et al. [27] suggested that 
the probabilit)' of philopatry, which is related to the fitness in terms 
of longevity, may depend on ecological factors related to the 
breeding site, although both the livestock farming and the 
architecture of rural buildings could also influence the choice 
between philopatry and dispersal [75]. Compared to the Spanish 
population, the lack of differentiation among all of the other ones 
could be due to the lower natal philopatry of their males. These 
populations, a mix of resident (phUopatric) and immigrant 
(dispersing) individuals, showed reciprocal small genetic distances 
(Figure 3B, Table 3), and suggested that the barn swallow's 
dispersal behaviour should be regarded as a rather plastic trait 
[27]. However, when the SPA population was excluded from the 
Bayesian clustering analysis (only males), a not negligible degree of 
genetic differentiation was disclosed across Europe (Figure SI 
versus Figure 4B). Although this result could seem related to the 
occurrence of differences in the migratory routes as we have 
suggested for the females (see next paragraph), we feel more 
confident in stating that male philopatry in each population is the 
primar}' c:ause for the population genetic structure we ha\ e found. 

Genetic structure and migratory behaviour (males versus 
females) 

When only the females were taken into atxxjunt the genetic 
scenario was strikingly different compared to that inferred using 
the males or the entire sample size. The average -Fst distance 
values (Table 3; with related PCA of Figure 3C) as well as the 
Bayesian clustering (Figure 4C) marked out two groups of females: 
one (central to western Europe) included SPA, IRE and GER, the 
other (central to eastern Europe) comprised ITA, CYP and RUS. 
The genetic distance between the two groups was significant 



(^ST = 0.020, P<0.00 1 , data not shown). Admixed genotypes were 
more frequent and gene flow level higher in the females than in the 
males (Figure 4, Tables 3, S4), this pointing to a higher dispersal 
rate and a lower philopatry of the first group compared to the 
second (female-biased dispersal) [4,8,9,27]. 

When the clustering of the females was considered the 
occurrence of a significant latitudinal component could not be 
ruled out (e.g., compare SPA versus IRE and CYP versus RUS). 
This, in turn, suggested that gene flow among populations could 
be partially influenced by the axis of migration. However, the 
divergence between ITA and GER also suggests that dispc-rsal 
does not strictly occur along a North-South axis. On the other 
hand, the genetic difierentiation between eastern and western 
populations resembles what has been observed for other passerine 
species showing a clear migratory divide in central Europe [19,76]. 
While clear evidences for the existence of a migratory divide do 
lack in the barn swallow, ringing data indicate that the autumn 
migratory routes of eastern and western populations of this species 
are different. Barn swallows from western Europe head for the 
Iberian Peninsula, while those of eastern populations tra\ (:l down 
throughout the eastern Mediterranean and the Middle East. 
Again, barn swallows from central Europe may travel South 
straight across the Mediterranean or south-west to Spain [20]. 
This pattern of migration roughly parallels the results obtained by 
Ambrosini et al. [25] regarding the migratory connectivity in H. 
rustica. This analysis, which was carried out on a large dataset of 
ringing recoveries, produced weight for the existence of two main 
clusters: one includes birds breeding in south-west Europe and 
wintering in central Africa, the other comprises birds breeding in 
northern and eastern Europe and wintering in southern Africa. 
The genetic relationships disclosed in the present study, on one 
side, between RUS and CYP and, on the other, between SPA, 
GER and IRE, fit to the groups described by [25] and to the main 
migratory movements summarized in [20]. Furthermore, mark- 
recapture data provided by BirdLife Cyprus (A.C. Author pers. 
comm. 2012) also point to a single migration route between RUS 
and CYP. Whereas according to Ambrosini et al. [25] ITA should 
be separated from the RUS and CYP populations, our results 
point to the occurrence of some genetic kinship among the females 
of these countries (Figure 4A, Table S4). Overall, this may either 
indicate that the pattern of gene flow is only partially related to the 
migratory movement or suggest a poorly documented migratory 
route that connects central Italy, north-west Russia and Cyprus. 
Actually, the geographic area of birds recovered in Italy or ringed 
in Italy and recovered abroad is huge and include several eastern 
and south-eastern countries (e.g., Turkey and Greece, [77]). These 
wide- connections, probably diu- to the position of the Italian 
Peninsula in the Mediterranean, should be further investigated by 
extending the number of the Italian sampHng locations in order to 
attempt to disentangle a very complex pattern of gene flow. 

When only the males were considered, the very high philopatry 
of the Spanish population forced the whole genetic scenario by 
isolating the latter against all of the other ones (see previous 
paragraph). On the contrary, when the SPA males were excluded, 
we disclosed slight differences among the studied populations that 
pointed to the occurrence of three groups: (IRE -I- GER), (ITA -I- 
CYP) and RUS (Figure SI). This pattern showed some degree of 
correspondence with the female clustering, thus suggesting that the 
mechanisms driving the direction of dispersal movements are 
probably the same in both sexes. Nevertheless, being aware that 
further investigations are needed, we suggest do not over- 
interpreting these results. 
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Sex-biased dispersal 

Cases of inconsistency between population structures inferred 
from mtDNA or from STR variability are well known. A greater 
population differentiation can be detected using nuclear rather 
than mitochondrial markers (e.g.: [78,79], this study) but the 
opposite may occur as well (e.g.: [71,80]). Although both natal and 
breeding dispersal may enable high connectivity among popula- 
tions [1,2,5,6], breeding dispersal is virtually absent in the barn 
swallow [27]. Hence, sex-biased natal dispersal represents the best 
explanation fitting the observed discordance, although tests for the 
whole dataset failed to obtain significance for all parameters 
(Table 4). However, as stressed by [10], their overall outcome must 
be discussed in light of the amount of combined factors that may 
affect the statistical power, namely the dispersal rate and the bias 
intensity, the sampling design and the number of investigated STR 
loci [16,81]. For instance, when the dispersal rate and the sex-bias 
are not strong vAIq seems the most powerful test, followed by ihAIq 
and FsT, which work better with less than 20 loci [10]. In this 
study, vAIq was higher in the females than in the males, thus 
pointing to the occurrence of female-biased dispersal. When the 
tests were performed and SPA was separated from all other 
populations, R and i^s i v alues were significantly lower in the 
females than in the males, mAIc value was negative (Table 4) and 
Fig value was lower in males than in females, thus suggesting high 
phUopatry for the Spanish males. 

In conclusion, we partially confirmed published data reporting 
the lack of a significant partition of diversity using mtDNA 
markers, while, in contrast to previous studies, we detected 
significant genetic structure by using nuclear DNA markers. The 
different scenarios observed between sexes could be explained with 
the non-random patterns of gene flow likely mediated by female 
natal dispersal and significant variability in male phUopatry among 
barn swallow populations. Our results emphasize the importance 
of taking into account the sex of sampled individuals to obtain 
unbiased results on species showing a different pattern of dispersal 
between males and females. 

Supporting Information 

Figure SI Bayesian admixture analysis as inferred 
using STRUCTURE performed excluding the SPA population 
in male and female barn swallows. AK was optimal for 
K= 2, all computations. Each population was represented by a pie 
chart whose segments were proportional to the number of 
individuals assigned to cluster I (black), cluster II (white) or which 
showed admixed genotypes (grey). Threshold value for assignment 
to each cluster was Q; = 0.80. 
(DOC) 

Table SI The sample size (n = 186) of this study and the 
mtDNA sequences downloaded £rom the GenBank 
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The number of male (M) and female (F) individuals genotyped 
with mitochondrial and STR markers was given (unavailable on 
line for MED population). GenBank accession codes for KRD and 
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Table S2 Estimates of mtDNA genetic diversity (aver- 
agez± SD) as computed for each population and for the 
whole dataset. Sites: number of segregating sites. 
(DOC) 

Table S3 Average mtDNA pairwise distance values 
(Phisx) as computed among all populations; *, P<0.05; 
**, P<0.001; others, P>0.05. 

(DOC) 

Table S4 Proportion of estimated membership of all 
populations to each cluster (Qj and Qji) as inferred by 
STRUCTURE with K=2 (Figure 4). Computations were per- 
formed using: (i) all individuals (« = 186), (ii) males (n = 92) and (iii) 
females [n = 94). The number of specimens assigned to the cluster I 
(hi) and II {tin) was reported as well as the number of individuals 
showing admixed genotype (re^ix). The threshold value for 
admixed assignment was Qj = 0.80. 
(DOC) 
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